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ABSTRACT 

Basic taste qualities like sour, salty, sweet, bitter 
and umami serve specific functions in identifying 
food components found in the diet of humans and 
animals, and are recognized by proteins in the oral 
cavity. Recognition of bitter taste and aversion to it 
are thought to protect the organism against the in- 
gestion of poisonous food compounds, which are 
often bitter. Interestingly, bitter taste receptors are 
expressed not only in the mouth but also in extraoral 
tissues, such as the gastrointestinal tract, indicating 
that they may play a role in digestive and metabolic 
processes. BitterDB database, available at http:// 
bitterdb.agri.huji.ac.il/bitterdb/, includes over 550 
compounds that were reported to taste bitter to 
humans. The compounds can be searched by 
name, chemical structure, similarity to other bitter 
compounds, association with a particular human 
bitter taste receptor, and so on. The database also 
contains information on mutations in bitter taste re- 
ceptors that were shown to influence receptor acti- 
vation by bitter compounds. The aim of BitterDB is 
to facilitate studying the chemical features asso- 
ciated with bitterness. These studies may contribute 
to predicting bitterness of unknown compounds, 
predicting ligands for bitter receptors from different 
species and rational design of bitterness modulators. 

INTRODUCTION 

The bitterness recognition puzzle 

The mammalian sense of taste is mediated by receptor pro- 
teins located in the oral cavity. Each of the five basic taste 
qualities - sour, salty, sweet, bitter and umami, serves a 
specific function in identifying components found in an 
animal's diet (1,2). Since many toxic plant metabolites 
taste bitter to mammals, bitter taste receptors are 
thought to have evolved to protect the organism against 



the ingestion of poisonous food compounds. Bitter com- 
pounds, surveyed in (3-7), are very diverse in terms of 
their chemical structure and physicochemical properties 
(8,9). 

How can hundreds or more of structurally diverse com- 
pounds be detected by a limited number of receptors is 
an intriguing question. In particular, in humans, bitter- 
taste perception is mediated by 25 G-protein coupled 
receptors (GPCRs) of the hTAS2R gene family (10). 
Characterization of the receptive range of many 
hTAS2Rs (11,12) suggests that, this receptor family 
consists of both broadly tuned and narrowly tuned recep- 
tors. Apparently, individual hTAS2Rs are activated by a 
battery of both structurally related and unrelated bitter 
compounds, while preserving selectivity for chemical 
groups and even stereo-selectivity (11,13,14). The struc- 
tural basis for hTAS2Rs' unique ability to recognize nu- 
merous chemically diverse, and low-affinity (micromolar 
range) agonists is not fully understood. Assignment of 
bitter tastants to individual receptors has been accom- 
plished for some ligands (11,12,14-16) but remains 
unknown for many others. 

Not only taste perception 

Recent reports about expression of taste receptors in non 
gustatory tissues suggest that these proteins may have 
additional functions apart from detection of taste 
(8,17,18). Of the extraoral tissues expressing taste recep- 
tors, the gastrointestinal tract has received much attention 
since mounting evidence indicates that after being ingested, 
bitter tastants might play important regulatory roles in 
digestive and metabolic processes (18). hTAS2Rs were 
also found in human airway smooth muscle tissue and 
the effect of bitter tastants on function of human bronchi 
is currently being studied (19,20). 

Identification and design of modulators 

Studying bitter molecules and designing selective and po- 
tent agonists and antagonists of bitter taste receptors has 
broad applications, both in the field of food sciences and 
for therapeutic indications. To date, antagonist molecules 
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have been found by massive screening (21) and unexpect- 
edly, when studying bitter taste receptor signaling (22). 
Electronic databases of compounds, such as the publicly 
available ZINC database (23) (http://zinc.docking.org/), 
have been extensively used in recent years for computer- 
aided drug design. The need for focused and specialized 
resources of compounds for virtual screening is growing as 
computational methods are being applied to novel research 
fields beyond classical drug discovery. BitterDB is an im- 
portant addition to the recent databases in the field of 
chemical senses, such as sweet tastant molecules database 
(24) and database of volatile compounds relevant to odors 
and scents (25). 

Motivation 

To summarize, taste recognition is important for food 
sensing and possibly plays additional physiological roles. 
Characterizing molecules as bitter and assigning them to a 
specific receptor represents a crucial step towards predic- 
tion and rational modulation of bitterness. Creating a 
database of known bitter compounds, which, in addition 
to associations between bitter molecules and their recep- 
tors, also contains the chemicophysical properties of the 
ligands and mutagenesis data relevant for ligand binding 
[e.g. (26,27)], is an essential step in this direction. 

DATABASE OVERVIEW 

BitterDB overview/description 

BitterDB currently contains more than 550 compounds 
that were cited in the literature as bitter. For each com- 
pound, BitterDB offers information regarding its 
molecular properties, references for the compound's bit- 
terness, including additional information about the bitter- 
ness category of the compound (e.g. a 'bitter-sweet' or 
'slightly bitter' annotation), different compound identifiers 
(smiles, CAS registry number, IUPAC systematic name), 
an indication whether the compound is derived from a 
natural source or is synthetic, a link to the compound's 
PubChem entry (28) and different file formats for down- 
loading (sdf, image, smiles). 

Around 100 bitter compounds have been experimen- 
tally linked to their corresponding human bitter taste re- 
ceptors (12,15,16,29,30). For those compounds, BitterDB 
provides additional information, including links to the 
publications indicating these ligand-receptor interactions, 
the effective concentration for receptor activation and/or 
the EC 50 value and links to the associated bitter taste 
receptors entries in the BitterDB. 

Importantly, the BitterDB database offers its users a 
rich interface for querying and browsing the bitter com- 
pounds dataset. BitterDB is designed to enable the user to 
easily find specific bitter compounds, as well as to perform 
more advanced searches for a range of bitter compounds 
with specific properties. The 'simple search' option allows 
one to retrieve bitter compounds by name, identifiers 
(smiles description, PubChem ID, CAS registry number) 
or a specific associated receptor (Figure 1A). The 
'advanced search' allows one to retrieve compounds that 
fit different criteria, such as a combination of specific 



physical properties (see Example 1 and Figure 1A) or a 
combination of associated human bitter taste receptors. 
The 2D structure similarity search allows one to retrieve 
bitter compounds that are similar to a chemical structure 
query (Example 2 and Figure 1A). In addition to the dif- 
ferent querying options, the user can browse through a 
table with all the BitterDB compounds and their pro- 
perties. The compounds in the table can be sorted accord- 
ing to different criteria (Example 3 and Figure IB). 

BitterDB also contains data about the 25 known human 
bitter taste receptors. Several properties are displayed for 
each receptor: molecular weight, protein sequence, 
genomic location [according to the UCSC genome browser 
(31)], as well as links to the corresponding UCSC Genome 
browser (31) and UniProt (32) entries. For each receptor, 
a list of its known bitter ligands, collected from the litera- 
ture, is displayed with links to the individual ligand entries 
and an option to download ligand structures as a SDF file. 
A 2D protein diagram, which offers an intuitive way of 
looking at the sequence of the receptor and its trans- 
membrane (TM) regions, is displayed for each receptor. 
For several receptors, data regarding mutagenesis experi- 
ments performed on the receptor sequence were found in 
the literature. For these receptors we generated a mutation 
table that summarizes the mutagenesis experiments and 
their effects on receptor function. 

The bitter taste receptors can be searched using different 
criteria: name, known ligands and UniProt accession 
number. Using a table that presents all the bitter taste 
receptors and some information about them, the user 
can browse and also sort by various options, such as the 
number of bitter ligands associated with the receptor. 

In addition, BitterDB offers two extra features: a local 
BLAST service to determine the local similarity between a 
query sequence and the different human bitter taste recep- 
tors (Example 4), and a global alignment of the 25 human 
bitter taste receptors. The alignment was generated using 
ClustalW2 (33), and displays the secondary structure of 
each receptor, as predicted by the TOPCONS server (34). 
In addition, the most conserved residue in each TM helix 
is annotated by its Ballesteros-Weinstein numbering 
scheme (35); the most conserved residue in a given TM 
is assigned the index X.50, where X is the TM number, 
and the remaining residues are numbered relative to this 
position. 

Importantly, BitterDB offers an 'upload' option, by 
which the users are encouraged to submit information 
about bitter compounds. These data will be reviewed by 
the authors and uploaded to the database accordingly. 
The users can join the BitterDB mailing list in order to 
receive database updates. 

MATERIALS AND METHODS 

Data acquisition 

Data were collected from the literature and from various 
web resources. Chemical compounds were included in the 
database only if they were described as bitter in the litera- 
ture, or if they were reported experimentally as capable 
of activating at least one human bitter taste receptor. 
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Figure 1. Screenshots of the BitterDB interface. (A) Illustration of three available search options: (i) Simple search: search compounds using 
keywords, (ii) Structure-based similarity search: 2D similarity search using different structural formats (drawing, smiles and user uploaded structure 
files), (hi) Advanced search, including searching by chemical descriptors, molecule identifiers and by associated human bitter taste receptors. Orphan 
receptors (shown in gray) cannot be searched by ligands. (B) The BitterDB compound browse table (which can be sorted by different criteria). 
(C) An example of a query result page: the user can download an SDF file of selected compounds from the result page. (D) A representative 
compound entry page. 
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The main resources were (numbers in brackets indicate the 
numbers of compounds retrieved): the Merck index (36) 
[300], Fenaroli's Handbook of Flavor Ingredients (37) [94] 
and a recent publication by Meyerhof et al. (12) [98]. Most 
of the 2D chemical structures of the compounds and the 
IUPAC systematic names were taken from PubChem (28). 
A set of physical descriptors such as molecular weight, 
logP, molecular solubility, water solubility, H-bond 
donors, H-bond acceptors and the number of rings, aro- 
matic rings and rotatable bonds were computed for each 
compound using the 'calculate molecular properties proto- 
col' in Discovery Studio 2.5.5 (DS2.5.5; Accelrys, Inc.). 

Data on human bitter taste receptors were taken mainly 
from the UniprotKB database (32). Experimental muta- 
genesis data on the bitter taste receptors were collected 
from the literature. The main resource was Brockhoff 
et al. (26). The secondary structure/topology of the recep- 
tors was predicted by the TOPCONS server (34). 

Structure 

Similarity searches. MarvinSketch, 5.3.8, 2010 
(ChemAxon, Inc.), an advanced chemical editor for 
drawing chemical structures by the user, was implemented 
as the built-in molecular editor. The structure similarity 
search was implemented with Open Babel Package, 
version 2.2.3 http://openbabel.sourceforge.net/ (accessed 
December 2010). The degree of similarity between the 
query molecule and database entries is assessed by the 
Tanimoto coefficient using the MACCS fingerprints. 
MACCS fingerprints are a set of bitstrings; each bit 
refers to the presence or absence of one of the 166 
MACCS keys in a given molecule. The MACCS 



fingerprints were pre-calculated for all the BitterDB mol- 
ecules and are calculated during the search for the query 
structure, in order to compare it to the database entries. 
The Tanimoto coefficient, which is a number between 0 
and 1 (1 corresponding to 'maximum similarity'), is used 
to measure the distance between two molecules (query and 
database entry). The Tanimoto coefficient calculates the 
number of bit positions set to 1 in both fingerprints, 
divided by the number of bit positions set to 1 in at 
least one of the fingerprints. The top 20 results are 
returned. 

Blast. BitterDB contains a local BLAST service, powered 
by the NCBI BLAST package (38). We created a custom 
database of the 25 protein sequences of the human bitter 
taste receptors whose query sequence is searched against. 
The user is able to control the different search parameters, 
such as scoring matrix, gap cost and others. 

Visualization tools j features. BitterDB contains a range of 
visualization tools. These include a 2D compound struc- 
ture display and a 2D image for each compound, which 
are generated using Marvin, 5.3.8, 2010 (ChemAxon 
http://www.chemaxon.com/), a 2D transmembrane protein 
diagram with indication of the most conserved residue in 
each helix and residues for which mutation data is avail- 
able (see Figure 2) for each human bitter receptor, which 
is based on the TOPCONS secondary structure prediction 
(34) and an annotated bitter receptor alignment, which is 
powered by JalView (39). 

Web Server. BitterDB was designed as a relational 
database on a MariaDB server http://mariadb.org/. 




Figure 2. 2D transmembrane protein diagram. A convenient representation of the receptor sequence and its TM regions, shown here for hTAS2R16. 
The seven TM helices are displayed as predicted by Topcons. The most conserved residue in each helix X (BW number X.50) has black background. 
Residues for which mutation data are available are marked with red, and information about the mutations can be found in the site 'Known 
Mutations Table' section below the diagram http://bitterdb.agri.huji.ac.il/bitterdb/Receptor.php7id = 16#ReceptorKnownMutations. 
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The website was built with the PHP server side, and web 
access is enabled via an Apache HTTP Server. The site is 
best viewed in Firefox, Opera, Chrome and Safari 
browsers. 



EXAMPLES 

Examples of BitterDB usage are provided below. A 
movie illustrating each example was recorded from the 
BitterDB website; the movies can be accessed from the 
'Examples' link of the database, http://bitterdb.agri.huji. 
ac.il/bitterdb/examples.php 

Possible usage: filtering molecules using their properties 

Specific Example #1: Searching for bitter molecules that 
satisfy Lipinskis rule of five criteria. It is often useful to 
obtain a subset of molecules having particular physico- 
chemical properties. As an example, we demonstrate how 
one can obtain a subset of bitter molecules that satisfy 
Lipinski's rule of five (40). This rule, based on properties 
of known orally available drugs, states that in general, an 
orally active drug has no more than one violation of the 
following criteria: Not more than 5 H-bond donors or 
10 H-bond acceptors, a molecular mass under 500Da, 
and an octanol-water partition coefficient (logP) of less 
than five (40). In movie #1 (http://bitterdb.agri.huji.ac.il/ 
bitterdb/examples.php#exl), we demonstrate how, using 
Lipinksi's criteria as a filter, we retrieve 83% of the mol- 
ecules in our database. These molecules satisfy Lipniski's 
rule of five and therefore can be considered 'drug-like'. 




1) AristolochicAcid 



0 

ii 

3) Divinyl sulfoxide 



Possible usage: finding bitter compounds using the 2D 
similarity search 

Specific Example #2: Searching for compounds with a 2D 
structure similarity to aristolochic acid. It is commonly 
assumed that chemically similar compounds may bind to 
the same receptor (41-43). However, the chemical features 
of ligands of the same bitter taste receptor are very diverse 
(see Figure 3). It is therefore worthwhile to extend our 
exploration of the chemical space around known ligands 
of bitter taste receptors, in order to gain more information 
about the physicochemical determinants for bitterness. 
Furthermore, since most of the bitter compounds are 
not yet assigned to a particular bitter taste receptor target, 
similarity of a query compound to a ligand with known 
receptor association(s) may provide a helpful clue. 

In Example 2 (http://bitterdb.agri.huji.ac.il/bitterdb/ 
examples. php#ex2) we searched for compounds with a 
2D structure similarity to aristolochic acid, a known 
ligand of the bitter taste receptor hTAS2R14 (as well as 
hTAS2R43 and hTAS2R44) (12). Nine of the top ten re- 
sults were not yet assigned to any bitter receptor target. 
However, result number six, Noscapine, is also a ligand of 
hTAS2R14. This interesting result suggests that additional 
aromatic compounds that bear resemblance to aristo- 
lochic acid could be potential hTAS2R14 ligands. 

Possible usage: Browsing and sorting bitter compounds 

Specific Example #3: Sorting compounds by their logP 
value. Although the structural and physicochemical deter- 
minants for bitterness remain largely unknown, there are 




2) Falcarindiol 




4) Picrotoxinin 



Figure 3. A subset of four hTAS214 ligands demonstrating a high degree of structural variability. These four structurally diverse compounds, which 
are all hTAS214 ligands, demonstrate that hTAS214 ligands lie in a broad chemical spectrum that includes compounds from different classes such as 
aromatic polycyclic compounds (1), polyacetylenes (2), sulfoxides (3) and cycloparaffins (4). 
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some chemical properties within specific chemical families 
that are related to bitterness (44-46). For example, several 
studies proposed a correlation between hydrophobicity 
and bitterness (44,45). Additionally, a recent study 
showed that bitterness of the soy peptides is predominant- 
ly associated with molecular mass (47). 

Hence, it is interesting to sort the dataset of BitterDB 
compounds by different molecular properties and to 
explore the associated ranges of bitterness. In this 
example (ht tp : / /bi tter db . agr i . huj i . ac . il /bi tter db /example s . 
php#ex3), we sorted the dataset according to the 
logP-value, which is one of the means of measuring 
small molecules' hydrophobicity. 

Possible usage: relating bitter taste receptors from 
different species to human bitter receptors 

Specific Example #4: finding human bitter taste receptors 
with sequence similarity to the zebra fish (Danio rerio) 
bitter taste receptor TAS2R5. Organisms' sense of bitter 
taste helps them avoid toxic and harmful substances. 
Although research on bitter taste in humans has 
advanced rapidly, less is known about taste perception 
in other organisms. Thus, it is beneficial to relate novel 
taste receptors to human ones, and learn about potential 
common ligands, relevant known mutations, etc. 

The Zebrafish has only four intact TAS2R genes as 
opposed to 25 genes in humans. In this example (http:// 
bitterdb.agri.huji.ac.il/bitterdb/examples.php#ex4), the 
zebrafish TAS2R5 receptor protein sequence was 
searched against the human bitter taste receptors se- 
quences in BitterDB. The goal was to find the most similar 
human receptors using the BitterDB local BLAST service. 
The human receptor with the highest score for local 
similarity is hTAS2R20 (also known as hTAS2R49). 
hTAS2R20 has two known ligands — diphenidol and 
cromolyn, which, to the best of our knowledge, have not 
yet been tested on zebrafish T2Rs. The second most similar 
receptor, hTAS2R42, is still an orphan receptor, and the 
next one, hTAS2R4, has 15 known ligands. Interestingly, 
one of the hTAS2R4 ligands is denatonium benzoate, 
which was also reported as a zebrafish TAS2R5 ligand 
(48). Detailed studies of ligand-receptor interactions from 
different species are needed to better understand the range 
of activity of the different receptors and its relation to the 
number of receptor types in a given species. Yet, as a first 
guess, the receptor similarity option can be used to 
provide a preliminary list of suspected ligands, based on 
the ligands of human receptors with the highest sequence 
similarity, which were already de-orphanized. 



SUMMARY AND OUTLOOK 

Our goals in the present study were (i) to compile a com- 
prehensive dataset of all chemical structures that have 
been cited in the literature as having bitter taste, and 
(ii) to assign bitter ligands to their cognate receptors in 
humans, where possible. To the best of our knowledge, 
although a proprietary database of bitter compounds 
was established in the past (5), BitterDB is the first 



publicly available, electronically searchable database of 
bitter molecules. 

Interestingly, not only does the receptor receptive range 
vary within bitter taste receptors (12) — also the number of 
functional bitter taste receptors varies among different 
species [e.g. (49)]. The number of receptors in a given 
species may relate to the receptors' receptive range, the 
effects of variation within the population (4) and the en- 
vironmental pressure. The ability to understand and pre- 
dict these effects relies on the knowledge of what is 'bitter' 
(or aversive) to different species, and on the ability to 
cluster ligands based on their chemical similarity. 
BitterDB will be extended in the future to include add- 
itional species, and thus will contribute to elucidating evo- 
lutionary aspects of bitter taste perception. 

Fundamental questions as well as potential therapeutic 
applications of the modulation of bitter taste receptors 
continue to motivate further development of the 
BitterDB database. Future extensions may include imple- 
menting 3D similarity searches of compounds, modeling 
3D structures of the receptors and mapping single 
nucleotide polymorphisms (SNPs) onto these models. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Files. 
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